When Words Sweat: Identifying Signals for Loan Default in the Text of Loan Applications
نویسنده
چکیده
The authors present empirical evidence that borrowers, consciously or not, leave traces of their intentions, circumstances, and personality traits in the text they write when applying for a loan. This textual information has a substantial and significant ability to predict whether borrowers will pay back the loan over and beyond the financial and demographic variables commonly used in models predicting default. The authors use text-mining and machine-learning tools to automatically process and analyze the raw text in over 120,000 loan requests from Prosper.com, an online crowdfunding platform. The authors find that loan requests written by defaulting borrowers are more likely to include words related to their family, mentions of God, the borrower’s financial and general hardship, pleading lenders for help, and short-term focused words. The authors further observe that defaulting loan requests are written in a manner consistent with the writing style of extroverts and liars. Using a counterfactual analysis, the authors demonstrate that applying their finding can yield a 9.7% additional return on investment.
منابع مشابه
Matrix Sequential Hybrid Credit Scorecard Based on Logistic Regression and Clustering
The Basel II Accord pointed out benefits of credit risk management through internal models to estimate Probability of Default (PD). Banks use default predictions to estimate the loan applicants’ PD. However, in practice, PD is not useful and banks applied credit scorecards for their decision making process. Also the competitive pressures in lending industry forced banks to use profit scorecards...
متن کاملThe Effects of Loan Officers’ Compensation on Loan Approval and Performance: Direct Evidence from a Corporate Experiment
To understand better the role of loan officers in the origins of the financial crisis, we study a controlled experiment conducted by a large bank. In the experiment, the incentive structure of a subset of small business loan officers was changed from fixed salary to commission-based compensation. We use a diffin-diff design to show that while the characteristics of loan applications did not cha...
متن کاملInvestigating the missing data effect on credit scoring rule based models: The case of an Iranian bank
Credit risk management is a process in which banks estimate probability of default (PD) for each loan applicant. Data sets of previous loan applicants are built by gathering their data, and these internal data sets are usually completed using external credit bureau’s data and finally used for estimating PD in banks. There is also a continuous interest for bank to use rule based classifiers to b...
متن کاملAn Assessment of Beneficiaries’ Satisfaction of the Management of Loan Contract Components by Farmer Cooperative Societies in Edo State, Nigeria
The study assessed beneficiaries’ satisfaction in the management of loag-contract components by cooperatives involved in the farm credit delivery in Edo State. The objective was to identify the components of the farm loan contract, examine the management strategies and rate the beneficiaries’ satisfaction of such management strategies. This was done by purposively selecting 40 cooperatives invo...
متن کامل